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ABSTRACT 

Attempts  have  been  made  to  distinguish  between  visual  mechanisms  as  being  space  domain  fea¬ 
ture  detectors  or  transform  domain  spatial  frequency  detectors.  Those  descriptions  can  be 
considered  equivalent  in  terms  of  producing  similar  filtered  images.  However,  Fourier-like 
transformation  appears  to  be  a  more  accurate  description  of  the  filtering  process  when  bio¬ 
logical  data  is  considered.  -- 


INTRODUCTION 


There  is  still  no  generally  accepted  conceptual  or  physical  framework  with  which  to  under¬ 
stand  spatial  vision.  Few  would  argue  the  importance  of  receptive  fields  of  neurons  as 
the  prime  mechanisms  of  vision.  Much  research  has  attempted  to  determine  if  receptive 
fields  are  either  feature  or  spatial  frequency  detectors.  It  is  suggested  that  such  a 
dichotomy  is  largely  semantic  and  masks  the  true  problem,  how  to  quantitatively  describe 
those  mechanisms  of  spatial  vision.  The  answer  will  most  likely  come  from  an  understanding 
of  the  function  of  neuronal  receptive  fields. 

The  function  of  neurons  in  the  visual  system  is  to  process  selective  information,  that  is, 
filter  information  along  certain  stimuli  dimensions.  For  example,  there  are  three  mechanisms 
selective  to  only  one  color:  red,  green,  blue.  These  mechanisms  are  called  color  channels, 
not  color  feature  detectors.  If  the  function  of  these  mechanisms  is  to  filter  color,  then 
the  relevant  stimulus  dimensions  to  relate  to  these  mechanisms  is  wavelength,  hue,  and  in¬ 
tensity.  This  same  view  is  suggested  for  mechanisms  in  the  visual  system  that  process  spa¬ 
tial  information.  Rather  than  attempt  to  distinguish  particular  semantic  differences  of 
names  for  spatial  mechanisms,  perhaps  a  more  relevant  approach  to  their  function  is  in  terms 
of  filtering. 

The  relevant  properties  of  a  filter  is  its  bandwidth  (the  range  of  stimulus  dimension  that 
the  filter  is  maximally  tuned  to),  center  frequency  (a  reference  point,  usually  the  fre¬ 
quency  of  the  maximum  response  of  the  filter  bandwidth),  and  the  weighting  function  (the 
shape  of  the  filter,  the  way  it  attenuates  the  information  over  the  bandwidth). 

This  paper  will  describe  certain  aspects  of  two  different  filtering  processes.  Those  pro¬ 
cesses  will  be  related  to  known  biological  mechanisms  to  help  determine  the  most  relevant 
way  of  describing  spatial  filtering  in  vision.  The  general  filtering  properties  of  certain 
mechanisms  will  suggest  a  common  transformation  of  spatial  information  that  can  be  used  to 
describe  their  individual  function  and  their  collective  process.  The  relevance  of  this  ap¬ 
proach  will  be  discussed  in  terms  of  quantifying  visual  perception. 

THE  SPATIAL  FILTERING  PROCESS 

The  term  spatial  filtering  is  commonly  used  to  describe  the  process  of  attenuating  the  fre¬ 
quency  spectrum  of  an  image.  Here  that  definition  is  extended  to  describe  the  attenuation 
of  spatial  size  or  frequency  using  either  space  or  frequency  domain  techniques.  There  are 
two  general  spatial  filter  processes:  convolution  and  spectral  attenuation.  Both  processes 
will  be  discussed  and  related  to  known  visual  mechanisms  in  an  attempt  to  determine  which 
process  can  best  provide  a  model  for  biological  spatial  filtering.  One  dimensional  analysis 
will  be  used  to  simplify  the  discussion  with  no  loss  of  generality. 

Convolution  is  a  space  domain  filtering  process  given  by 
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w  11 
pf(nx)  =  Z  p1(nx-mx  )  h(nx  ) 


In  words,  pattern  p^  is  rotated,  that  is,  folded  about  pattern  h  and  shifted  in  increments 
of  mx1  over  pattern  h.  These  patterns  are  multiplied  after  each  shift.  The  result  is  a 
filtered  image  pf  (nx).  If  either  pattern  is  symmetric,  then  the  folding  process  can  be 
ignored.  It  should  be  noted  that  the  process  of  convolution  is  identical  to  correlation  if 
there  is  no  folding. 

Filtering  using  spectral  attenuation  requires  the  transformation  of  the  input  pattern  into 
the  frequency  domain.  Fourier  transformation  is  the  most  common  process  used.  (However, 
there  are  many  other  transformations  that  could  be  used  for  similar  filtering.  Discussion 
of  these  other  transforms  are  beyond  the  scope  of  this  paper  and  the  term  Fourier-like  in¬ 
dicates  the  general  transform  process.)  Fourier  transformation  is  given  by: 

N  /nxv 

P(nx)  -  Z  p(nx)  e”  IT  (2) 

n=o 

/nx, 

where p(nx)  is  the  input  pattern  and  e~°  are  complex  exponentials.  The  complex 

exponentials  can  be  shown  to  be  equivalent  to  complex  sinusoids  using  Eulers  identity.  Thus 
eqn.  2  becomes: 

N 

p(nx)  =  Z  p(hx)  (cos  2tt(^-)  “  J  sin  2tt(^)]  (3) 

n*l  -  — 

In  words,  the  Fourier  transform  requires  that  a  pattern  p(nx)  be  multiplied  by  sine  and  cosine 
patterns  of  different  spatial  frequencies,  resulting  in  two  pieces  of  information.  The  co¬ 
sine  term  is  an  even  symmetric • function,  the  real  (Re)  part  of  the  Fourier  transform.  The 
sine  term  is  an  odd  symmetric  function,  the  imaginary  (Im)  part  of  the  Fourier  transform. 

The  magnitude  of  the  Fourier  coefficants  used  to  determine  the  Fourier  spectrum  is  given  by 


M  =  V  Re  +  Im  (4) 

n  n  n 

and  phase,  the  relative  position  between  the  cosine  and  sine  functions,  is  determined  from 


The  Fourier  transform  process  described  so  far  has  neither  gained  nor  lost  any  information 
about  the  input  pattern.  It  has  transformed  the  spatial  information  of  the  pattern  into 
another  domain.  The  result  of  filtering  the  input  pattern  by  these  sinusoids  can  be  related 
to  the  amount  of  size  or  periodicity  of  spatial  information  in  the  original  pattern.  If 
X  is  the  spatial  period  or  size  of  a  particular  feature,  then  xn  =  n/X  is  the  spatial  fre¬ 
quency.  Periodicity  is  the  angular  spatial  frequency  given  by  kn  *  2iTXn.  Thus,  x  is 
cycles  per  unit  length  and  k  is  periods  per  unit  length.  The  fundamental  spatial  frequency 
and  periodicity  is  given  by  xi  and  k^  respectively,  x^,  xg,  ...,  xn  and  ki»  k2»  *3»  •••» 
kn  are  the  harmonic  spatial  frequencies  and  periodicities  respectively. 

The  main  point  is  that  the  decomposition  of  the  original  pattern  by  the  sinusoids  is  itself 
&  filtering  process.  Further  filtering  is  accomplished  by  using  only  a  subset  of  the  trans¬ 
formed  data.  If  a  subset  of  that  data  is  retransformed  into  the  space  domain  using  an  in¬ 
verse  Fourier  transform,  the  result  will  be  a  filtered  image.  Passing  only  the  low  spatial 
frequencies  will  create  a  smoothed  image  as  shown  in  Figure  1,  fc  *  4.  If  only  the  high 
spatial  frequencies  are  passed,  an  image  having  only  edges  and  finer  details  result  as  shown 
in  Figure  1,  fc  *  64.  Thus,  by  passing  a  certain  portion  of  Fourier  components,  or  spatial 
information  resulting  from  filtering  an  object  with  periodic  patterns,  different  spatial 
information  that  can  be  related  to  the  size  of  features  that  make  up  the  original  object 
can  be  extracted  for  subsequent  use.  This  important  point  will  be  discussed  further  in  a 
later  section. 

CONVOLUTION  OR  FOURIER  TRANSFORMATION  AS  A  MODEL  OF  VISUAL  FILTERING  PROCESSES? 

Filtering  by  convolution  or  spectral  attenuation  can  produce  identical  results.  Indeed, 
there  is  a  theorem  in  Fourier  analysis  stating  that  the  convolution  of  two  functions  is 
identical  to  the  product  of  the  Fourier  transformed  functions.  Thus,  from  a  functional 
point  of  view,  neither  method  of  filtering  is  indistinguishable  from  the  other.  There  is, 
however,  a  difference  between  these  two  methods  in  how  they  perform  the  filtering. 
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The  Fourier  process,  unlike  convolution,  does  not  require  a  shift  of  the  decomposing  pattern 
(complex  sinusoids  in  the  case  of  the  Fourier  transform)  as  part  of  the  filtering  process. 

The  Fourier  process  does  its  filtering  using  stationary  patterns.  Unlike  convolution,  fil¬ 
tering  using  the  Fourier  technique  is  a  two-stage  process.  The  input  pattern  is  first  de¬ 
composed  or  filtered  using  limited  periodic  patterns  (limited  in  the  sense  of  using  a  dis¬ 
crete  Fourier  transform  having  finite  sinusoids,  more  relevant  to  biological  analogues  than 
the  infinite  sinusoids  of  the  continuous  Fourier  transform).  The  second  stage  of  filtering 
requires  that  certain  ranges  of  the  decomposed  pattern  be  extracted  over  certain  bandwidths 
(and  orientation  in  the  case  of  the  tvo-dimentional  Fourier  transform).  Note  that  there  is 
no  need  to  create  a  spectrum  from  these  d  ita  to  describe  the  process.  Only  the  transforma¬ 
tion  of  the  original  pattern  features  in-o  band-limited  ranges  of  spatial  information  relat¬ 
ed  to  periodicity  is  necessary  to  specify  the  Fourier  process.  The  result  is  a  filtered 
image  similar  to  that  produced  by  convolution. 

Which  process  is  more  meaningful  in  terms  of  our  current  knowledge  of  the  function  of 
certain  receptive  fields?  For  this  argument,  the  filtering  process  of  the  retina  and  LGN 
will  not  be  discussed.  The  major  decomposition  of  the  visual  scene  appears  to  take  place 
at  the  visual  cortex.  It  is  here  that  perhaps  a  distinction  can  be  made  between  convolution 
and  transformation  as  a  key  filtering  process  of  spatial  vision. 

Convolution,  if  used  by  biological  systems,  would  require  two  processes.  One  process  is 
needed  to  shift  the  image  of  the  input  pattern  over  the  filtering  mechanism,  the  receptive 
field,  or,  like  a  shift  register,  record  the  outputs  of  a  group  of  similar  sized  receptive 
fields,  located  along  the  extent  of  the  pattern  features.  There  is  no  evidence  for  biolog¬ 
ical  mechanisms  having  a  shift  process.  A  counter  argument  for  convolution  could  be  the 
necessity  of  eye  movements  to  mediate  spatial  vision:  it’s  the  function  of  eye  movements 
to  sweep  the  retinal  image  across  space  and  convolve  it  with  visual  receptive  fields.  How¬ 
ever,  the  magnitude  of  involutary  saccadic  eye  movement  is  about  12  minutes  of  arc.  Since 
this  would  mean  that  objects  whose  size  is  greater  than  12  minutes  of  arc  would  not  be  con¬ 
volved,  this  hypothesis  seems  untenable.  Voluntary  eye  movement  can  be  ruled  out  for  two 
reasons:  they  are  generally  random  and  objects  can  be  correctly  processed  under  tachisto- 
scopic  presentation  even  though  there  is  insufficient  time  for  voluntary  eye  movements. 

There  is  another  way  to  perform  filtering  using  convolution.  If  convolution  is  implimented 
by  rows  of  evenly  spaced,  adjacent,  possibly  overlapping,  receptive  fields  of  similar 
spatial  properties,  then  convolution  can  be  achieved  by  reading  out  the  summed  product  for 
each  group  of  similar  receptive  fields.  But  this  is  the  process  earlier  defined  for  Fourier 
transformation.  Therefore,  implimenting  convolution  using  in-place  filters  is  given  by  the 
mathematics  of  Fourier  transformation. 

A  Fourier  processes  requires  certain  additional  necessary  and  sufficient  conditions  be 
satisfied  before  it  can  be  considered  to  be  biologically  relevant.  First,  there  must  be 
evidence  for  receptive  fields  having  periodicity  over  a  range  of  periods  consistent  with 
the  size  of  objects  that  are  seen.  Second,  these  receptive  fields  must  have  even  and  odd 
symmetry,  i.e.,  have  cosine  and  sine  weighting  functions.  Third,  the  receptive  fields  must 
be  collected  over  certain  ranges  of  periodicity  for  subsequent  use.  Filtering  with  one 
extended  sinusoid  would  result  in  only  the  first  stage  of  a  Fourier  process,  not  particularly 
useful  by  itself.  Finally,  even  if  these  conditions  are  satisfied,  further  criteria  for 
use  is  simply  utility.  Is  this  method  of  describing  certain  visual  processes  useful?  Does 
it  provide  insight?  Each  point  will  be  discussed  in  turn. 

Hubei  and  Wiesel  (l)  found  receptive  fields  in  visual  cortex  maximally  tuned  to  two  general 
spatial  dimensions:  si2e  and  orientation.  The  general  weighting  function  of  these  rectan¬ 
gular  receptive  fields  are  periodic,  having  alternating  increasing  and  decreasing  weighting 
considered  to  be  regions  of  "excitation”  and  "inhibition.”  Although  Hubei  and  Wiesel  cells 
are  typically  represented  by  only  1  or  1.5  periods,  recently  Albrecht  (2)  has  shown  simple 
cells  having  over  U  periods.  Thus,  the  first  condition  of  a  Fourier-like  transform, 
periodicity  is  satisfied. 

The  next  condition  of  even  (cosine)  and  odd  (sine)  symmetric  receptive  fields  is  easily 
satisfied.  Receptive  fields  having  these  kinds  of  weighting,  shown  by  Hubei  and  Wiesel, 
have  been  characterized  as  line  and  edge  detectors.  Acknowledging  these  receptive  fields 
as  cosine  and  sine  filters  would  explicitly  imply  a  Fourier-like  process.  The  idealization 
of  these  periodic  receptive  fields  having  three  cycles  is  shown  in  two-dimensions  in 
Figure  2a  (cosine  on  top,  sine  on  bottom).  Figure  2b  shows  a  three-dimensional  view  of 
these  receptive  fields.  The  typical  schematic  of  the  Hubei  and  Wiesel  cells  is  shown  in 
the  upper  right  corner  of  their  respective  idealized  functions.  It  is  pointed  out  that 
these  idealized  receptive  fields  are  the  first  three  sine  and  cosine  terms  of  a  two-dimen¬ 
sional  discrete  Fourier  transform. 
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The  third  point  is  that  these  periodic  receptive  fields  be  used  in  clumps  collected  over 
space  having  limited  bandvidths.  Evidence  for  simple  cells  having  different  bandvidths  and 
different  center  frequencies  was  shown  by  DeValois  (3).  Bandwidths,  ranging  from  0.5  to  over  2 
octaves,  had  center  frequencies  that  covered  a  large  range  of  size. 

The  preceeding  considerations  suggest  that  Fourier-like  transformation  is  a  relevant  way  to 
describe  certain  visual  processes.  The  suggestion  of  sinusoidal  receptive  fields  is  ideal¬ 
ized.  A  more  exact  description  of  the  receptive  fields  would  cause  the  periodicity  to  be 
tapered  consinusoidally .  An  approximation  to  these  filters  would  be  Hermitian  polynomials 
(Young,  U).  However,  it  may  be  more  useful  at  this  stage  of  our  knowledge  to  pursue  the 
function  of  the  two  general  kinds  of  receptive  fields  that  exist.  Filtering  done  by  the 
author  using  many  filters  having  different  weighting  functions  suggests  that  the  particular 
shape  and  bandwidth  of  the  filters  do  not  critically  change  filtered  images  for  perhaps  80- 
90%  of  perceptual  phenomenon.  Exact  brightness  functions  and  lengths  of  features  will  be 
somewhat  dependent  on  the  particular  bandwidth  and  weighting  function. 

It  is  suggested  that  the  relevant  information  for  spatial  vision  is  (excluding  for  the  moment 
color,  motion,  etc.)  is  periodicity  (spatial  frequency ), orientation,  contrast  and  position  in 
the  visual  field.  Note  that  periodicity  is  substituted  for  size.  If  the  visual  system  were 
interested  only  in  filtering  out  size  information  in  objects,  then  the  extent  of  either  an 
even  or  odd  symmetric  field  would  need  be  only  one  cycle.  Similar  shaped  receptive  fields 
could  be  made  having  different  widths  to  cover  a  full  range  of  size.  But  many  receptive 
fields  have  been  found  to  have  over  b  cycles. 

Here  is  perhaps  the  best  distinction  that  can  be  made  to  determine  whether  receptive  fields 
are  encoding  size  or  another  stimulus  property.  Since  spatial  frequency  is  by  definition  a 
reciprocal  of  size,  that  description  does  not  help  any  argument  distinguish  between  size  and 
spatial  frequency  as  prime  stimulus  attributes  for  receptive  fields.  However,  what  does 
distinguish  between  a  size  or  other  measures  is  an  additional  property  found  in  certain  re¬ 
ceptive  fields  in  the  visual  system,  periodicity  or  the  number  of  cycles,  i.e.,  the  number  of 
bands  of  positive  and  negative  weighting  across  the  receptive  field.  It  would  seem,  there¬ 
fore,  that  the  function  of  the  receptive  field  may  be  best  described  by  the  periodicity  or 
number  of  cycles  per  unit  distance  that  occurs  across  the  receptive  field  rather  than  size. 

It  follows  that  the  function  of  certain  neurons  in  the  visual  system  is  to  perform  a  crude 
periodicity  or  frequency  analysis.  This  analysis  supports  the  suggestions  of  many  research¬ 
ers  who,  from  engineering,  neurophysiological,  and  psychophysical  considerations,  have 
suggested  that  the  visual  system  is  a  crude  Fourier  analyzer  (e.g.,  see  Ginsburg,  5). 

Agreement  between  the  mathematics  and  the  biological  process  still  does  not  mean  that  the 
visual  system  is  creating  a  two-dimensional  Fourier  transform  spectrum.  However,  the  biolog¬ 
ical  process  appears  to  be  taking  the  information  from  the  spatial  domain  and  decomposing  it 
into  a  form  analogous  to  a  two-dimensional  Fourier  transform  domain.  The  distinction  is 
important  because  in  the  former  case  one  expects  to  see  a  literal  spectrum  at  the  visual 
cortex.  This  is  not  necessary  in  the  latter  case.  Here  the  expectation  is  for  the  visual 
system  to  exhibit  certain  processing  characteristics  that  would  be  consistent  with  a  two- 
dimensional  Fourier-like  transform.  In  other  words,  it  would  be  expected  that  certain  visual 
phenomenon  exhibit  spatial  information  filtered  along  the  two  dimensions  that  are  processed 
by  a  Fourier-like  transform:  spatial  frequency  or  periodicity  and  orientation. 

IMPLICATIONS  FOR  SPATIAL  TRANSFORMATIONS  AND  PERCEPTION 

Are  there  any  perceptual  processes  that  suggest  processing  is  constrained  to  these  dimen¬ 
sions?  The  answer  is  yes  and  is  typified  by  the  perception  of  a  field  of  multistable 
triangles  such  as  those  shown  in  Figure  3a.  When  these  multistable  triangles  are  gazed  upon 
they  all  perceptually  group  together  in  one  of  three  orientations,  changing  shape  and 
appearing  in  depth  (Attneave,  6).  If  these  triangles  are  randomly  placed%  it  is  very 
difficult,  if  not  impossible,  to  make  one  of  these  triangles  flip  independently  of  the  other 
triangles.  Given  that  the  psychophysics  and  neurophysiology  suggests  that  the  spatial 
information  of  these  triangles  are  filtered  in  terms  of  spatial  frequency  and  orientation 
typified  by  the  Fourier  magnitude  spectrum  (Figure  3b),  what  would  happen  if  these  objects 
are  filtered  along  limited  ranges  of  those  two  dimensions.  This  is  shown  in  Fig.  3C.C1,  d, 
3*,  e,  e1.  Filtering  out  one  of  two  channels  in  each  of  three  possible  orientation 
combinations  demonstrates  this  perceptual  experience.  Perhaps  similar  filtering  is 
occurring  in  the  visual  system. 

The  preceeding  example  demonstrated  one  consequence  of  being  constrained  to  process  spatial 
information  in  a  Fourier-like  way.  The  next  example  demonstrates  the  kind  of  spatial 
information  that  can  be  extracted  from  band-limited  filter  mechanisms.  There  is  much 
evidence  to  suggest  that  the  visual  system  is  extracting  spatial  information  using  mecha¬ 
nisms  whose  bandwidths  are  about  1  to  2  octaves  in  spatial  frequency  and  ♦  15  degrees  in 
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orientation.  These  mechanisms  sure  called  channels,  a  name  coined  by  Campbell  and  Robson  (7). 
These  .channels,  if  used  for  object  recognition,  cannot  be  single  filters  such  as  a  receptive 
field  having  only  one  cycle  because  they  would  signal  similarly  for  dissimilar  objects. 
Channels  filtering  complex  spatial  information  need  to  be  created  from  a  range  of  smaller 
receptive  fields  to  provide  more  finely  sampled  spatial  information  and  phase  over  the 
channel  bandwidth. 


Bandwidtiis,  center  frequencies,  and  weighting  functions  relevant  to  object  recognition  can  be 
deduced  from  biological  data.  A  complex  object  such  as  a  portrait  requires  a  bandwidth 
greater  than  1  but  less  them  2  octaves  to  provide  sufficient  spatial  information  for  most 
visual  tasks  (Ginsburg,  5).  This  is  consistent  with  bandwidths  obtained  from  psychophysi¬ 
cal  and  neurophysiological  data.  Spatial  frequencies  closer  than  1  octave  appear  to 
interact  with  one  another,  therefore,  channel  independence  requires  that  they  be  separated 
by  about  1  octave.  The  weighting  function  for  channels  can  be  obtained  from  many  psychophys¬ 
ical  data  (e.g.  Blakemore  and  Campbell,  8;  Mostafavi  and  Sakrison,  9).  Further  consideration 
of  retinal  sampling  and  the  preceeding  analysis  suggests  a  maximum  of  7  channels  for  central 
vision  (Ginsburg,  5)* 

These  data  were  used  to  filter  a  portrait  as  shown  in  Figure  1.  The  numbers  beneath  each 
portrait  is  the  center  frequency,  fc,  for  each  channel.  The  center  frequency  referenced  to 
the  face  width  is  fc/2. 

Note  the  different  information  about  the  portrait  that  can  be  found  in  each  of  the  seven  two- 
octave  bandwidth  channels.  The  existence  of  an  object  is  evident  from  the  large  regions  of 
different  contrast  in  the  first  channel  (fc*l).  The  gross  elliptical  shape  of  an  object 
tnat  appears  right  side  up  is  seen  in  the  second  channel  (fc»2).  The  third  channel  (fc*M 
definitely  provides  enough  information  to  classify  the  object  as  a  face.  The  identification 
of  the  face  needs  a  little  more  information.  The  fourth  channel  (fc=8)  suggests  that  the 
face  is  that  of  a  woman  from  the  hair  style,  etc.  Identification  would  seen  possible  given 
a  limited  set  of  similarly  filtered  portraits  to  choose  from.  Clearly,  the  face  in  the 
fifth  channel  (fc®l6)  is  the  same  as  that  of  the  original  portrait,  and  identification  could 
present  no  problem  given  any  large  set  of  other  portraits.  If  information  about  the  details 
of  the  portrait  is  needed,  e.g.,  edges  and  lines,  the  hair  across  her  forehead,  the  texture 
of  her  hair,  size  of  the  pupils ,  outline  of  her  lips,  then  that  information  is  found  in  the 
two  remaining  channels  (fc*32,  6U).  The  last  channel,  fc*128,  has  virtually  no  usable 
information.  Thus,  these  seven  channels  create  a  hierarchy  of  filtered  images  that  provide 
the  full  spectrum  of  information  needed  for  any  perceptual  task. 

These  results  do  not  imply  that  all  of  spatial  vision  should  be  described  only  by  Fourier 
techniques.  There  are  certain  analysis  that  may  be  best  described  in  the  space  domain  of 
filtered  images.  For  example,  clustering,  symmetry,  and  figure-ground  analysis  using  images 
filtered  from  biological  data  of  complex  objects  have  been  demonstrated  by  Ginsburg  (5t  10). 
Recent  work  in  this  area  using  a  different  approach  to  texture  analysis  is  given  by  Marr, 
Palm,  and  Poggio  (ll).  Just  as  the  engineer  uses  convolution  and  Fourier  analysis  as 
dictated  by  the  problem  to  be  solved,  similar  criteria  is  suggested  for  helping  select  the 
techniques  for  analyzing  problems  in  vision. 

Biological  data  have  been  used  to  create  filters  that  describe  the  overall  filtering 
characteristics  of  the  visual  system.  Those  results  along  with  other  filtering  based  on 
channel  properties  provides  a  parsimonious,  quantitative  description  of  spatial  vision 
(Ginsburg,  5).  In  general,  the  overall  filtering  of  the  visual  system  limits  the  size  of 
objects  that  can  be  seen.  Further  filtering,  based  on  channels,  produces  filtered  images 
that  correspond  to  forms  that  are  perceived.  The  activity  of  all  the  channels  produces  the 
global  picture  that  is  seen. 

SUMMARY  AND  CONCLUSIONS 

1  The  two-dimensional  discrete  Fourier  transform  is  a  process  that  requires  the  input  object 
to  be  multiplied  by  a  bank  of  even  symmetric  cosinusoidal  and  odd  symmetric  sinusoidal  space 
functions  having  periodicity.  The  periodicity  corresponds  to  the  number  of  cycles  per 
receptive  field  width.  The  weighting  function  of  the  periodicity  is  alternating  positive 
and  negative  values.  These  properties  were  related  to  the  properties  of  cortical  receptive 
fields.  Certain  receptive  fields  may  functionally  perform  a  generalized  periodic  analysis 
fbr spatial  vision.  Since  these  functions  are  two-dimensional,  they  are  sensitive  to 
orientation,  similar  to  cortical  receptive  fields.  Therefore,  it  was  argued  that  the  visual 
system  contains  mechanisms  that  can  process  spatial  information  consistent  with  the  mathe¬ 
matical  process  of  a  two-dimensional  discrete  Fourier-like  transform.  This  analysis  of  the 
filtering  process  of  certain  visual  cortical  mechanisms  may  lead  to  further  under standing 
into  visual  perception. 
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